PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen01g036520.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family bHLH
Protein Properties Length: 464aa    MW: 50619.7 Da    PI: 7.3278
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen01g036520.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH49.11e-15288333555
                       HHHHHHHHHHHHHHHHHHHHCTSCCC...TTS-STCHHHHHHHHHHHHHHH CS
               HLH   5 hnerErrRRdriNsafeeLrellPkaskapskKlsKaeiLekAveYIksLq 55 
                       hn+ Er+RRd+iN+++ +L++l+P++      K +Ka++L +++eY+k+Lq
  Sopen01g036520.1 288 HNQSERKRRDKINQRMKTLQKLVPNS-----SKTDKASMLDEVIEYLKQLQ 333
                       9************************8.....7******************9 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF474593.27E-19281350IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PROSITE profilePS5088817.815283332IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000836.91E-17288337No hitNo description
Gene3DG3DSA:4.10.280.102.2E-18288342IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000102.7E-13288333IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003533.2E-18289338IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009567Biological Processdouble fertilization forming a zygote and endosperm
GO:0009506Cellular Componentplasmodesma
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 464 aa     Download sequence    Send to blast
MNQCVPSWDL DDSTVPRKNP IQTQSNSLAA DVPSLDYEVA ELTWENGQLA MHGLGPPRAN  60
NKPISSYGGT LESIVNQATR CNDDVPLHLH GKSTVDRNKQ GGDEVVPWFN NHNAVAYAPP  120
ATGLVAMTKD ALVPCSRNTS NSDNHRSVHV PGIDGSTHVG SCSGATNSRD WMVAPRMRVR  180
PTRREWSSRA DMISVSGSET CGGDSRQLTV DTFDREFGTT MYTSTSMGSP ENTSSDKQCT  240
NRTGDDHDSV CHIRDQKEGG DDEDDNNNKK GSKNSSSSTK RKRAAAIHNQ SERKRRDKIN  300
QRMKTLQKLV PNSSKTDKAS MLDEVIEYLK QLQAQVHMMS RMNMSPAMML PLAMQQQLQM  360
SMMGMGMGMG MGMGVAGVFD INNLSRPNIP GLPSFLHPSA AFMQPITSWD NSNSAPSPPS  420
AAMPDPLAAL LACQSQPINM DAYSRMAALY QQFQQPPTGS GPKN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1280295RKRAAAIHNQSERKRR
2291296ERKRRD
3292297RKRRDK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00034PBMTransfer from AT4G00050Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754400.0HG975440.1 Solanum pennellii chromosome ch01, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015062984.10.0PREDICTED: transcription factor UNE10 isoform X2
TrEMBLK4AZ200.0K4AZ20_SOLLC; Uncharacterized protein
STRINGSolyc01g090790.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA43672132
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00050.12e-82bHLH family protein